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The field of complex self-assembly is moving toward the design of multi-particle structures consisting of 
thousands of distinct building blocks. To exploit the potential benefits of structures with such ‘addressable 
complexity,’ we need to understand the factors that optimize the yield and the kinetics of self-assembly. 
Here we use a simple theoretical method to explain the key features responsible for the unexpected success 
of DNA-brick experiments, which are currently the only demonstration of reliable self-assembly with such a 
large number of components. Simulations confirm that our theory accurately predicts the narrow temperature 
window in which error-free assembly can occur. Even more strikingly, our theory predicts that correct 
assembly of the complete structure may require a time-dependent experimental protocol. Furthermore, we 
predict that low coordination numbers result in non-classical nucleation behavior, which we find to be essential 
for achieving optimal nucleation kinetics under mild growth conditions. We also show that, rather surprisingly, 
the use of heterogeneous bond energies improves the nucleation kinetics and in fact appears to be necessary 
for assembling certain intricate three-dimensional structures. This observation makes it possible to sculpt 
nucleation pathways by tuning the distribution of interaction strengths. These insights not only suggest how 
to improve the design of structures based on DNA bricks, but also point the way toward the creation of a 
much wider class of chemical or colloidal structures with addressable complexity. 


Recent experiments with short pieces of single- 
stranded DNAi>2 have shown that it is possible to as¬ 
semble well-defined molecular superstructures from a sin¬ 
gle solution with more than merely a handful of distinct 
building blocks. These experiments use complementary 
DNA sequences to encode an addressable structure^ in 
which each distinct single-stranded ‘brick’ belongs in a 
specific location within the target assembly. A remark¬ 
able feature of these experiments is that even without 
careful control of the subunit stoichiometry or optimiza¬ 
tion of the DNA sequences, a large number of two- and 
three-dimensional designed structures with thousands of 
subunits assemble reliably— The success of this ap¬ 
proach is astounding given the many ways in which the 
assembly of an addressable structure could potentially go 
wrong.— 

Any attempt to optimize the assembly yield or to cre¬ 
ate even more complex structures should be based on a 
better understanding of the mechanism by which DNA 
bricks manage to self-assemble robustly. The existence 
of a sizable nucleation barrier, as originally proposed in 
Refs. 0 and 3, would remedy two possible sources of er¬ 
ror that were previously thought to limit the successful 
assembly of multicomponent nanostructures: the deple¬ 
tion of free monomers and the uncontrolled aggregation 
of partially formed structures. Slowing the rate of nu¬ 
cleation would suppress competition among multiple nu¬ 
cleation sites for available monomers and give the com¬ 
plete structure a chance to assemble before encountering 
other partial structures. Recent simulations of a sim¬ 
plified model of a three-dimensional addressable struc¬ 
ture have provided evidence of a free-energy barrier for 
nucleation,— suggesting that the ability to control this 
barrier should enable the assembly of a wide range of 
complex nanostructures. We therefore need to be able to 


predict how such a barrier depends on the design of the 
target structure and on the choice of DNA sequences. 
Until now, however, there have been no reliable tech¬ 
niques to predict the existence, let alone the magnitude, 
of a nucleation barrier for self-assembly in a mixture of 
complementary DNA bricks. 

Here we show that the assembly of three-dimensional 
DNA-brick nanostructures is indeed a nucleated process, 
but only in a narrow range of temperatures. The nucle¬ 
ation barrier in these systems is determined entirely by 
the topology of the designed interactions that stabilize 
the target structure. Controllable nucleation is therefore 
a general feature of addressable structures that can be 
tuned through the rational choice of designed interac¬ 
tions. We find that the reliable self-assembly of three- 
dimensional DNA bricks is a direct consequence of their 
unusual nucleation behavior, which is not accounted for 
by existing theories that work for classical examples of 
self-assembly, such as crystal nucleation. We are thus 
able to provide a rational basis for the rather unconven¬ 
tional protocol used in the recent DNA-brick experiments 
by showing that they exploit a narrow window of oppor¬ 
tunity where robust multicomponent self-assembly can 
take place. 


STRUCTURE CONNECTIVITY DETERMINES 
ASSEMBLY 

In constructing a model for the self-assembly of ad¬ 
dressable structures, we note that the designed interac¬ 
tions should be much stronger than any attractive in¬ 
teractions between subunits that are not adjacent in a 
correctly assembled structure. The designed interactions 
that stabilize the target structure can be described by a 
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FIG. 1. Controlled nucleation is essential for robust self-assembly, (a) An example 86-strand DNA-brick structure and (b) its 
associated connectivity graph, (c) Incidental interactions between dangling ends, shown in orange, lead to incorrect associations 
between fragments, (d) Free energies of clusters of V subunits with randomly chosen DNA sequences in units of ksT, where &b 
is the Boltzmann constant and T is the absolute temperature. The nucleation barrier, A F*, and the target structure stability, 
A Fg, are strongly temperature-dependent, (e) The equilibrium yield with exclusively designed interactions and the nucleation 
barrier as a function of temperature. Also shown are the target structure stability and the free-energy difference between all 
off-pathway and on-pathway intermediates, in the presence, AF)^, and absence, A F in , of a nucleation barrier. The nucleation 
window is shown in orange, (f) Representative lattice Monte Carlo simulation trajectories with (blue) and without (gray) a 
temperature ramp; a typical malformed structure is shown in the inset, (g) The size of the largest correctly bonded stable 
cluster in lattice Monte Carlo simulations using a temperature ramp (blue) and in constant-temperature simulations initialized 
from free monomers in solution (gray). 


connectivity graph, G, in which each vertex represents a 
distinct subunit and each edge indicates a correct bond. 
This graph allows us to describe the connectivity of the 
structure independently of the geometry and spatial or¬ 
ganization of the building blocks. For structures con¬ 
structed from DNA bricks, the edges of G indicate the 


hybridization of DNA strands with complementary se¬ 
quences that are adjacent in the target structure. An ex¬ 
ample three-dimensional DNA-brick structure is shown 
along with its connectivity graph in Figures [TJi and[T|o. 

In an ideal solution with exclusively designed interac¬ 
tions, the subunits assemble into clusters in which all 
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allowed bonds are encoded in the connectivity graph of 
the target structure. In order to compute the free-energy 
difference between a particular cluster size and the un¬ 
bonded single-stranded bricks, we must consider all the 
ways in which a correctly bonded cluster with a given 
number of monomers can be assembled. These ‘frag¬ 
ments’ of the target structure correspond to connected 
subgraphs of the connectivity graph. In a dilute solu¬ 
tion with strong designed interactions, the numbers of 
edges and vertices are the primary factors determining 
the stability of a particular fragment. We therefore iden¬ 
tify all of the possible on-patlrway assembly intermediates 
by grouping fragments into sets with the same number 
of edges and vertices and counting the total number of 
fragments in each set.— 

This theoretical approach is powerful because it can 
predict the free-energy landscape as a function of the 
degree of assembly between the monomers and the tar¬ 
get structure. Furthermore, the predicted landscape cap¬ 
tures the precise topology of the target structure, which is 
essential for understanding the assembly of addressable, 
finite-sized structures. In the case of DNA-brick struc¬ 
tures, we can assign DNA hybridization free energies to 
the edges of the target connectivity graph in order to de¬ 
termine the temperature dependence of the free-energy 
landscape; for example, Figure [TJi shows the free-energy 
profile of the 86-strand DNA-brick structure with ran¬ 
dom DNA sequences at three temperatures. Our the¬ 
oretical approach allows us to calculate the nucleation 
barrier, A F$, by examining the free energies of clusters 
corresponding to fragments with exactly V vertices. The 
critical number of strands required for nucleation is VU 
transient clusters with fewer than strands are more 
likely to dissociate than to continue incorporating addi¬ 
tional strands. The presence of a substantial nucleation 
barrier therefore inhibits the proliferation of large, par¬ 
tially assembled fragments that stick together to form 
non-target aggregates. 


ASSEMBLY REQUIRES A TIME-DEPENDENT 
PROTOCOL 

Over a significant range of temperatures, we find that 
the free-energy profiles of DNA-brick structures exhibit 
both a nucleation barrier and a thermodynamically sta¬ 
ble intermediate structure. The nucleation barrier is as¬ 
sociated with the minimum number of subunits that must 
be assembled in order to complete one or more cycles, 
i.e. closed loops of stabilizing bonds in a fragment. For 
example, the critical number of monomers in the exam¬ 
ple structure at 319 K, V* = 8, is one fewer than the 
nine subunits required to form a bicyclic fragment of the 
target structure. Under the conditions where nucleation 
is rate controlling, the minimum free-energy structure is 
not the complete 86-particle target structure, but rather 
a structure with only V ~ 65 particles. This incomplete 
structure is favored by entropy, since it can be realized 


in many more ways than the unique target structure. 
Hence, the temperature where nucleation is rate con¬ 
trolling is higher than the temperature where the tar¬ 
get structure is the most stable cluster. The existence of 
thermodynamically stable intermediates is a typical fea¬ 
ture of DNA-brick structures and of complex addressable, 
finite-sized structures in general^ 

This behavior is not compatible with classical nucle¬ 
ation theory (CNT), which predicts that, beyond the 
nucleation barrier, large clusters are always more sta¬ 
ble than smaller clusters. As a consequence, in ‘classi¬ 
cal’ nucleation scenarios such as crystallization, there is 
a sharp boundary in temperature and concentration at 
which the largest-possible ordered structure, rather than 
the monomeric state, becomes thermodynamically sta¬ 
ble. Typically, a simple fluid must be supersaturated 
well beyond this boundary in order to reduce the nucle¬ 
ation barrier, which arises due to the competition be¬ 
tween the free-energy penalty of forming a solid-liquid 
interface and the increased stability due to the growth of 
an ordered structureYet in the case of addressable 
self-assembly, and DNA bricks in particular, a nucleation 
barrier for the formation of a stable partial structure may 
exist even when the target structure is unstable relative 
to the free monomers. 

An experiment to assemble such a structure requires 
a protocol: first nucleation at a relatively high temper¬ 
ature, and then further cooling to complete the forma¬ 
tion of the target structured This behavior can be seen 
in Figure [T^, where we identify a narrow temperature 
window in which there is a significant yet surmountable 
nucleation barrier. Unlike CNT, the nucleation barrier 
does not diverge as the temperature is increased. Instead, 
there is a well-defined temperature above which all clus¬ 
ters have a higher free energy than the free monomers. As 
the temperature is lowered further, the nucleation barrier 
disappears entirely before the equilibrium yield, defined 
as the fraction of all clusters that are correctly assem¬ 
bled as the complete target structure, increases measur¬ 
ably above zero. The equilibrium yield tends to 100% at 
low temperatures, since we have thus far assumed that 
only designed interactions are possible. Therefore, be¬ 
cause of the presence of stable intermediate structures, 
it is typically impossible to assemble the target structure 
completely at any temperature where nucleation is rate 
controlling. 

In order to examine the importance of a nucleation 
barrier for preventing misassembly, we calculate the free- 
energy difference between all off-pathway intermediates 
and all on-pathway intermediates, AF] n , by estimat¬ 
ing the probability of incidental interactions between 
partially assembled structures^ From the connectivity 
graph of the example DNA-brick structure, we can cal¬ 
culate the total free energy of aggregated clusters by con¬ 
sidering all the ways that partially assembled structures 
can interact via the dangling ends of the single-stranded 
bricks, as shown in Figure [TJ;. We also estimate this free- 
energy difference in the case of slow nucleation, AF^, by 
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FIG. 2. Dependence of the nucleation barrier on the coordination number of the target structure, (a) Example structures 
with tetrahedral coordination ( q c = 4), octahedral coordination ( q c = 6) and close-packed coordination ( q c = 12), along with 
their associated connectivity graphs, (b) Free-energy profiles of these three structures at 50% yield assuming identical bond 
energies within each structure. The black line shows the fit of classical nucleation theory to the nucleation barrier of the q c = 12 
structure, (c) The dependence of the nucleation barrier on the total number of strands, V(G), in DNA-brick structures with 
randomly chosen DNA sequences. The nucleation temperature does not increase monotonically with V(G) in these roughly 
cuboidal structures since surface effects are considerable. 


only allowing one of the interacting clusters in a misas- 
sembled intermediate to have V > V*. 

The above analysis supports our claim that a sub¬ 
stantial nucleation barrier is essential for accurate self- 
assembly. Our calculations show that even with very 
weak incidental interactions, incorrect bonding between 
the multiple dangling ends of large partial structures pre¬ 
vents error-free assembly at equilibrium, since A F- m > 0. 
The presence of a nucleation barrier slows the approach 
to equilibrium, maintaining the viability of the correctly 
assembled clusters. 

These theoretical predictions are confirmed by exten¬ 
sive Monte Carlo simulations of the structure shown in 
Figure QJt. In these simulations, the DNA bricks are 
modeled as rigid particles that move on a cubic lat¬ 
tice, but otherwise the sequence complementarity and 
the hybridization free energies of the experimental sys¬ 
tem are preserved^ Using realistic dynamics^ we sim¬ 
ulate the assembly of the target structure using a sin¬ 
gle copy of each monomer. In Figure [If, we compare a 
representative trajectory from a simulation using a lin¬ 
ear temperature ramp with a trajectory from a constant- 
temperature simulation starting from free monomers in 
solution. We also report the largest stable cluster size 
averaged over many such trajectories in Figure [Tf. Nu¬ 
cleation first occurs within the predicted nucleation win¬ 
dow where A F$ ~ 8 k^T. At 319 K, the size of the 
largest stable cluster coincides precisely with the pre¬ 
dicted average cluster size at the free-energy minimum 
in Figure [Tfl. kZ Intermediate structures assembled via a 
temperature ramp continue to grow at lower tempera¬ 
tures, while clusters formed directly from a solution of 
free monomers become arrested in conformations that 
are incompatible with further growth (Figure [If, inset). 


In agreement with our theoretical predictions, the simu¬ 
lation results demonstrate that a time-dependent proto¬ 
col is essential for correctly assembling a complete DNA- 
brick structure. 


COORDINATION NUMBER CONTROLS THE 
NUCLEATION BARRIER 

In the modular assemblies reported in Ref. the max¬ 
imum coordination number of bricks in the interior of the 
structure is four. However, one can envisage other build¬ 
ing blocks, such as functionalized molecular constructs 
or nano-colloids, that have a different coordination num¬ 
ber. To investigate the effect of the coordination number 
on the nucleation barrier, we compare the free-energy 
profile of a 48-strand DNA-brick structure with those of 
two higher-coordinated structures (Figure [2fi): a simple 
cubic structure with coordination number q c = 6 and a 
close-packed structure with q c = 12. (For a discussion of 
two-dimensional structures, see Sec. [S4l) In Figure [5f>, 
we show the free-energy profiles at 50% yield assuming 
identical bond energies within each structure. 

One striking difference between the q c = 4 structure 
and the higher-coordinated examples is the stability of 
the target at 50% yield. In the DNA-brick structure, the 
target structure coexists in nearly equal populations with 
a partial structure that is missing a single cycle. In the 
structures with higher coordination numbers, however, 
the target has the same free energy as the free monomers 
at 50% yield. Intermediate structures are therefore glob¬ 
ally unstable at all temperatures, as predicted by CNT. 

A second point of distinction among these structures 
lies in the relative stability of intermediate cluster sizes. 
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FIG. 3. Random DNA sequences can improve the kinetics of self-assembly, (a) A non-convex, 74-strand DNA-brick structure 
and its associated connectivity graph, (b) Free-energy profiles and effective nucleation barriers, AF , with both random and 
monodisperse designed interaction strengths. The temperatures are chosen such that the mean hybridization free energies 
of these two distributions are equal, (c) Representative lattice Monte Carlo simulation trajectories with heterogeneous and 
monodisperse bond energies. Correct assembly proceeds by assembling one face of the structure at a time, as shown on the 
right. The temperature of the monodisperse system must be lowered to achieve nucleation, but the largest cluster quickly 
becomes kinetically trapped (inset). 


Whereas the DNA-brick structure assembles by com¬ 
pleting individual cycles, the cubic structure grows by 
adding one face at a time to an expanding cuboid*^ With 
q c = 12, the greater diversity of fragments with the same 
number of vertices smooths out the free-energy profile 
near the top of the nucleation barrier. The fitted black 
line in Figure [2h> shows that the assembly of this struc¬ 
ture does in fact obey CNT (see Sec. IS2l) . 


The differences among these free-energy profiles origi¬ 
nate from the topologies of the connectivity graphs of the 
example structures. The most important determinant of 
the nucleation behavior is simply the number of vertices 
required to complete each additional cycle in the target 
connectivity graph, which is controlled by the maximum 
coordination number of the subunitsJ^ Our findings im¬ 
ply that controlled self-assembly of three-dimensional ad¬ 
dressable structures is unlikely to be achieved straightfor¬ 
wardly using subunits with coordination numbers higher 
than four. In higher-coordinated structures, which are 
well described by CNT, it would be necessary to go 
to high supersaturation in order to find a surmount¬ 
able nucleation barrier; however, such an approach is 
likely to fail due to kinetic trapping^ Yet in DNA-brick 
structures, the nucleation barrier is surmountable at low 
supersaturation and is relatively insensitive to the size 
of the target structure (Figure [2jc). The reliable self- 
assembly of large DNA-brick structures is thus a direct 
consequence of the small number of bonds made by each 
brick. 


HETEROGENEOUS BOND ENERGIES IMPROVE 
KINETICS 

Recent publications have argued that equal bond 
energies should enhance the stability of the designed 
structure2i and reduce errors during growth* 22 ' By con¬ 
trast, we find that the kinetics of DNA-brick assembly are 
actually worse if one selects DNA sequences that mini¬ 
mize the variance in the bond energies. Our observation 
is consistent with the successful use of random DNA se¬ 
quences in the original experiments with DNA bricks iS 
Here again, the nucleation behavior is responsible for this 
unexpected result. 

To demonstrate the difference between random DNA 
sequences and sequences chosen to yield monodisperse 
bond energies, we consider the relatively simple non- 
convex DNA-brick structure shown in Figure [3Ji. This 
74-brick structure, constructed by removing the interior 
strands and two faces from a cuboidal structure, assem¬ 
bles roughly face-by-face when using random DNA se¬ 
quences. The relevant nucleation barrier, as predicted 
theoretically in Figure [3jo and confirmed with Monte 
Carlo simulations in Figure [3J;, is the completion of the 
third face. With monodisperse bond energies and an 
equivalent mean interaction strength, a much larger nu¬ 
cleation barrier appears before the first face forms. At¬ 
tempts to reduce this nucleation barrier by increasing the 
mean bond energy result in kinetic trapping and arrested 
growth. Despite promising fluctuations in the largest 
cluster size in the simulation trajectory with monodis¬ 
perse energies, multiple competing nuclei appear, and the 
largest cluster remains poorly configured for further as¬ 
sembly (Figure [3fc, inset). 
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The use of sequences with a broad distribution of hy¬ 
bridization free energies results in a more suitable nucle- 
ation barrier because such a distribution selectively sta¬ 
bilizes small and floppy intermediate structures. This is 
a statistical effect: since there are far fewer ways of con¬ 
structing a maximally connected fragment with a given 
number of monomers, the chance that randomly assigned 
sequences concentrate the strongest bonds in a compact 
fragment is vanishingly small in a large structure. As a 
result, the dominant nucleation pathways no longer need 
to follow the maximally connected fragments. The use 
of a broad distribution of bond energies therefore tends 
to reduce nucleation barriers, since unstable fragments 
near the top of a barrier contain fewer cycles and are 
thus affected more significantly by the variance in the 
bond-energy distribution. 


DESIGN PRINCIPLES FOR (DNA) NANOSTRUCTURES 

The insights provided by our predictive theory allow 
us to understand the general principles underlying the 
unexpected success of DNA-brick self-assembly. Slow, 
controlled nucleation at low supersaturation is achieved 
for large structures since each brick can only make a small 
number of designed connections. Because of an apprecia¬ 
ble nucleation barrier that appears in a narrow tempera¬ 
ture window, monomer depletion does not pose a signifi¬ 
cant problem for one-pot assembly. Surprisingly, complex 
structures with randomly selected complementary DNA 
sequences experience enhanced nucleation, making larger 
intermediate structures kinetically accessible at higher 
temperatures. 

The use of a temperature ramp plays a more crucial 
role than previously thought. Cooling the DNA-brick 
solution slowly is not just a convenient way of locating 
good assembly conditions, as in the case of conventional 
crystals; rather, it is an essential non-equilibrium proto¬ 
col for achieving error-free assembly of finite-sized struc¬ 
tures. The explanation of slow nucleation and fast growth 
that was originally proposed in Refs. H] and @ is therefore 
incomplete: fast growth allows the DNA bricks to assem¬ 
ble into a stable, on-pathway intermediate that must be 
annealed at lower temperatures to complete the target 
structure. Remaining out of equilibrium throughout the 
assembly protocol, as is necessary in order to avoid the 
aggregation of partial structures, relies on the slow diffu¬ 
sion of large intermediates. This is a reasonable assump¬ 
tion, since the rate of diffusion changes approximately 
inversely with the radius of a fragment in solution^ 

Our approach also suggests how to improve the design 
of DNA-brick nanostructures beyond the random selec¬ 
tion of uniformly distributed DNA sequences. For a given 
target structure, it is easy to tune the nucleation barrier 
by adjusting the statistical distribution of bond energies. 
Complementary DNA sequences can then be assigned to 
the structure in order to achieve the desired distribution 
of hybridization free energies. Furthermore, with an un¬ 


derstanding of the origin of the nucleation barrier in a 
particular structure, it is possible to optimize the an¬ 
nealing protocol rationally in order to increase the yield 
of the target assembly. Our approach also provides a 
means of systematically investigating how local modifi¬ 
cations to the coordination number through the fusing of 
adjacent strands affect the nucleation behavior of DNA- 
brick structures^ 

The theoretical method used here greatly simplifies 
the quantitative prediction of nucleation barriers and in¬ 
termediate structures with widespread applications for 
controlling the self-assembly of biomolecular or synthetic 
building blocks. Addressable self-assembly holds great 
promise for building intricate three-dimensional struc¬ 
tures that are likely to require optimization on a case- 
by-case basis. Because our predictive theory is sensi¬ 
tive to the details of a particular target structure, per¬ 
forming these calculations for nanostructures of experi¬ 
mental interest will enable the precise engineering of as¬ 
sembly properties at the design stage. In order for po¬ 
tential users to perform such experimental protocol de¬ 
sign, we provide a user-friendly software package online 
at https://github.com/wmjac/pygtsa. 
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METHODS 

DNA hybridization free energies 

We compute the hybridization free energies of com¬ 
plementary 8-nucleotide DNA sequences using established 
empirical formula o 25126 assuming salt concentrations of 
[Na + ] = 1 mol dm -3 and [Mg 2+ ] = 0.08 mol dm -3 . For the 
calculations with monodisperse bond energies, we use the se¬ 
quences provided in Ref. [2*3. The strengths of incidental inter¬ 
actions are estimated based on the longest attractive overlap 
for each pair of non-complementary sequences. In calculations 
of the equilibrium yield and free-energy profiles, we report the 
average thermodynamic properties using 1000 randomly cho¬ 
sen complete sets of DNA sequences. See Sec. IS5I for further 
details. 

Lattice Monte Carlo simulations 

Constant-temperature lattice Monte Carlo simulations are 
carried out using the virtual move Monte Carlo algorithmic 
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in order to produce physical dynamics. Rigid particles, each 
with four distinct patches fixed in a tetrahedral arrangement, 
are confined to a cubic lattice. A single copy of each required 
subunit is present in the simulation box with 62 x 62 x 62 
lattice sites. Complete details are given in Ref. @. For com¬ 
parison with the results of these simulations, the theoreti¬ 
cal calculations reported here assume the same dimensionless 
monomer concentration, p = 62~ 3 , lattice coordination num¬ 
ber, q c = 4, and fixed number of dihedral angles, qa = 3 (see 
Sec. ED). 
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SUPPLEMENTARY INFORMATION 


SI. THEORETICAL FREE-ENERGY AND 
EQUILIBRIUM YIELD CALCULATIONS 

We construct the connectivity graph, G, from the de¬ 
signed bonds between adjacent subunits in the target 
structure. An example connectivity graph is shown in 
Figure |Tj>. From this graph we are able to determine all 
the relevant thermodynamic properties of the interme¬ 
diate and target structures in a near-equilibrium assem¬ 
bly protocol. A thorough explanation of this theoretical 
method is presented in Ref. 10; here we summarize the 
key equations. 

The connected subgraphs (‘fragments’) of the target 
connectivity graph are grouped into sets, h(E,V), in 
which all fragments have precisely E edges and V ver¬ 
tices. Assuming that subunits can only form clusters 
with designed bonds, the dimensionless grand potential, 
— In S, can be written in terms of a sum over all sets of 
fragments, 

Z id = lnE = ^2\h(E,V)\z E ,v. (1) 

E,V 

The average fugacity of the fragments in each set, z E y, 
depends on the topologies of the fragment graphs as well 
as the geometry of the subunits and the solution con¬ 
ditions. Ignoring excluded volume interactions, we can 
approximate In z E ,v as 

In z E y = EpeEy + V\np 

-(V-l)lnq c -(V-B E ,v-l)\nq d , (2) 

where fee In q c is the rotational entropy of a monomer, 
ks In qd is the dihedral entropy of an unconstrained 
dimer and p is the dimensionless concentration. The 
mean dimensionless dihedral entropy of a fragment is 

Be v l n 9d = ln/qf^ ) , where B(g) is the num- 

\ / g£h(E,V) 

ber of bridges^ in the fragment g. The exponentially 
weighted mean bond energy within each set, /3e E ,v, is 

/3e E y = ( ln ( exp ^ f3e b \ V (3) 

\ \ be£( g ) l g&h{Ey) / seq 

where e& is the hybridization free energy of bond b , 
/3 = 1/£;bT is the inverse temperature and £(g) is the 
edge set of fragment g. The inner average runs over 
all fragments in the set h(E , V) with quenched random 
bond energies {e&}. In the case of DNA-brick structures, 
the outer average samples DNA sequences so that each 
complete set of bond energies for the target structure is 
chosen independently from the same distribution of hy¬ 
bridization free energies. 

The free energy of a correctly bonded cluster of V 
monomers is thus 

PF(V) = -lnJ2\h(E,V)\z E y, (4) 

E 


since we do not distinguish among equally sized clusters 
with varying compositions. This definition is appropriate 
for studying nucleation, as any subset of monomers has 
the potential to serve as a nucleation site. 

The equilibrium yield, 77 , is defined as the fraction of 
all clusters in solution that are correctly formed, 


= (Ag) _ Zg_ 

V ~Eg(Ng) _ Z id ’ 


(5) 


where (Nq) is the grand-canonical average number of 
copies of fragment g in solution and zq is the fugacity of 
the target structure. In the case of structures with higher 
coordination numbers, a few edges may be removed from 
the connectivity graph without allowing any subunit to 
disassociate or rotate. For these structures, we replace zq 
in Eq. © with a sum over the fugacities of all fragments 
that enforce the correct geometry of the target structure. 


S2. CLASSICAL NUCLEATION THEORY 

Classical nucleation theory predicts a free-energy bar¬ 
rier for the nucleation of a stable, ordered structure from 
an unstable, fluid phase. The height of the barrier and 
the size of the critical nucleus vary with the degree of su¬ 
persaturation of the fluid phase ; 13 ’ 14 Assuming spherical 
nuclei, the classical prediction for the free-energy differ¬ 
ence between a nucleus of the ordered phase containing 
V monomers and the bulk fluid phase is 

A F(V) = -ApV + 1 (36np- 2 d ) 1/3 (V-V 0 ) 2/3 , ( 6 ) 

where A p is the bulk free-energy difference per particle 
between the fluid phase and the ordered phase, 7 is the 
free-energy cost per unit area of forming an interface be¬ 
tween the two phases and p 0 rd is the number density of 
the ordered phase. In Figure [2):>, the black line shows the 
fit to Eq. © with Vq = 1. 


S3. CONCENTRATION DEPENDENCE 

In the main text, we report all results of the theoretical 
calculations assuming a dimensionless concentration of 
62 -3 for comparison with the lattice Monte Carlo simu¬ 
lations. Changing this concentration shifts both the equi¬ 
librium yield and the nucleation barrier linearly with ln p. 
The concentration dependence of the nucleation barrier 
of the example 86 -strand structure at several tempera¬ 
tures is shown in Figure [Si] Although we have assumed 
equal concentrations of all monomers, polydispersity in 
the monomer concentrations can be easily incorporated 
into the theoretical treatment in much the same way as 
the distributions of designed interaction energies. 
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FIG. SI. The dependence of the height of the nucleation 
barrier, A F*, on the dimensionless concentration, p, for the 
example 86-strand structure shown in Figure [TJr. 


S4. ASSEMBLY OF TWO-DIMENSIONAL 
STRUCTURES 

We find that the shapes of the free-energy profiles of 
two-dimensional structures are also strongly affected by 
their coordination numbers. In Figure IS21 we show the 
free-energy profiles at 50% yield of two similarly sized 
two-dimensional structures with coordination numbers 
q c = 3 and q c = 4. The behavior of the two-dimensional 
structure with q c = 3 is similar to that of the four- 
coordinated three-dimensional structures examined in 
the main text. The two-dimensional structure with 
q c = 4 exhibits the same face-by-face assembly as three- 
dimensional structures with octahedral coordination; in 
the two-dimensional case, however, coexistence at 50% 
yield occurs between the target structure and fragments 
with the same number of monomers but fewer bonds. 

Two-dimensional DNA-tile structures with q c = 4 have 
been successfully assembled^ In general, the nucleation 
barriers in two-dimensional structures are much lower 
than in three-dimensional structures with a similar num¬ 
ber of monomers. Consequently, a lower supersaturation 


is required in order for the target structure to become 
kinetically accessible, and kinetic trapping is therefore 
less likely to interfere with accurate assembly. Never¬ 
theless, these calculations suggest that lower coordinated 
two-dimensional structures, such as the hexagonal lattice 
pictured in Figure 152k . might assemble more robustly in 
experiments. 


S5. HYBRIDIZATION FREE-ENERGY DISTRIBUTIONS 

DNA hybridization free energies are strongly 
temperature-dependent i 25 ’ 26 In Figure IS3k. we compare 
the two hybridization free-energy distributions used 
in the main text. The mean and the variance of the 
hybridization free energies of 8-nucleotide sequences are 
shown for both the case of randomly chosen sequences 
and the case of sequences selected to yield monodisperse 
bond energies^ Designed interactions occur between 
complementary sequences, while incidental interactions 
are calculated based on the most attractive overlapping 
regions of two non-complementary sequences. 

In the lattice Monte Carlo simulations, all monomers 
on adjacent lattice sites experience a weak 100 K re¬ 
pulsion at all temperatures. In calculations involving 
designed interactions, this repulsion is subtracted from 
the mean interaction strength. When estimating inci¬ 
dental interactions, we ignore associations between non- 
complementary sequences that have a maximum attrac¬ 
tive interaction of less than 100 K. The means and vari¬ 
ances reported in Figure IS3k are thus calculated based 
on the fraction of pairs of non-complementary sequences 
that attract more strongly than 100 K; this fraction is 
shown in Figure IS3b. 

These incidental interaction distributions are clearly 
approximate and are defined in order to match the lat¬ 
tice Monte Carlo simulations. Nevertheless, the choices 
made in defining these distributions have a negligible ef¬ 
fect on the calculated values of Zy^ and are irrelevant 
to the prediction of nucleation barriers and equilibrium 
yields. In order to apply this theoretical method to an ex¬ 
perimental system, the designed interaction distributions 
should be recalculated in accordance with the experimen¬ 
tal solution conditions. 
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FIG. S2. Dependence of the nucleation barrier on the coordination number of two-dimensional structures, (a) Connectivity 
graphs of two-dimensional structures with coordination numbers q c = 3 and q c = 4. (b) The free-energy profiles of these 

structures at 50% yield assuming identical bond energies within each structure. For both structures, we assume q<\ = 2. 
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FIG. S3. DNA hybridization free-energy distributions, (a) The mean and variance of the hybridization free-energy distributions 
with randomly chosen DNA sequences (left) and with sequences selected to yield monodisperse bond energies (right). Designed 
interactions refer to complementary DNA sequences, while incidental interactions report the hybridization free energy of the 
most attractive overlapping region of two non-complementary sequences. Note that the means of the designed interactions 
of these two distributions are shifted by approximately 10 K. (b) The fraction of pairs of non-complementary sequences that 
attract more strongly than 100 K. 
















